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ABSTRACT 

We present the first interferometric ^^CO {J — 1 ^ 0) map of the entire Ha disk of M33. The 
13" diameter synthesized beam corresponds to a linear resolution of 50 pc, sufficient to distinguish 
individual giant molecular clouds (GMCs). Erom these data we generated a catalog of 148 GMCs 
with an expectation that no more than 15 of the sources are spurious. The catalog is complete down 
to CMC masses of 1.5 x 10^ Mq and contains a total mass of 2.3 x 10^ Mq. Single dish observations 
of CO in selected fields imply that our survey detects ^ 50% of the CO flux, hence that the total 
molecular mass of M33 is 4.5 x lO"^ Mq, approximately 2% of the H I mass. The GMCs in our catalog 
are confined largely to the central region (i? < 4 kpc) . They show a remarkable spatial and kinematic 
correlation with overdense H I filaments; the geometry suggests that the formation of GMCs follows 
that of the filaments. The GMCs exhibit a mass spectrum dN/dM oc ji,f-2.6±o.3^ considerably steeper 
than that found in the Milky Way and in the LMC. Combined with the total mass, this steep function 
implies that the GMCs in M33 form with a characteristic mass of ~ 7 x 10'* M©. More than 2/3 of 
the GMCs have associated H II regions, implying that the GMCs have a short quiescent period. Our 
results suggest the rapid assembly of molecular clouds from atomic gas, with prompt onset of massive 
star formation. 
Subject headings: 



1. INTRODUCTION 

\ This paper presents the results of an interferometric 
survey in ^^CO (J = 1 — > 0) of the entire Ha disk 
of M33. Observations were made with the Berkeley- 
lUinois-Maryland Association array. The proximity and 
low inclination of M33, an SA(s)cd spiral in the Lo- 
cal Group, permits direct and detailed comparisons of 

\ molecular, atomic, and ionized hydrogen gas on the 
scale of individual GMCs. The angular resolution of 
the BIMA survey is 13", which corresponds to a linear 

, resolution of 50 pc at the distance of M33 (0.85 Mpc, 

■ iLee. Ereedman. fc Mador3 IT993'l. This resolution is ad- 
equate to separate, but not to resolve, individual giant 
molecular clouds (GMCs) if they are similar in size to 

■ those in the Milkv Wav fe htz 1993). A follow-up pa- 
per by ijRosolowskv. et aiTl2003. Paper II) describes a 
study of rotation properties, virial masses, and morphol- 
ogy of 36 GMCs from this survey which were reobserved 
at BIMA with 20 pc linear resolution. The BIMA ar- 
ray is well-suited for this extensive mapping project for 
three reasons. (1) The 6-meter diameter antennas have a 
large (100" EWHM) field of view at 115 GHz; this keeps 
the number of pointing centers to a manageable level. 
(2) With 10 telescopes, the array affords good snapshot 
imaging capability; visiting each field only a few times 
per night yields a clean svnthesized bea m. (3) The SIS 
receivers l|Engargiola fc Plambeck! [l998j) have excellent 
sensitivity; double sideband receiver temperatures at 115 
GHz are typically 40 K. 

Of the numerous mechanisms proposed for molecular 
cloud formation (Elmegreen 199 0). it is uncertain which, 
if any, is responsible for most GMCs in spiral galaxies. 
Validating any cloud formation model requires an unbi- 
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ased survey of molecular clouds in a galaxy. Prior to the 
present study of M33, there existed no complete catalog 
of giant molecular clouds in an external spiral galaxy. 
Even cloud catalogs in the Milky Way are incomplete 
because velocity blending along the line-of-sight can pre- 
vent unambiguou s separation into individual clouds (e.g. 
Ilssa. MacLaren. fc Woifendalelll990f) . Only in the Large 
Magellenic Cloud, a Local Group irregular, has CO been 
surveyed with sufficient re solution to measur e accurately 
the CMC mass spectrum fEukui et al."200lD. 

A decade ago, Wilson & Scovillc ( 1990) undertook a 
pioneering survey of extragalactic GMCs in the nuclear 
region of M33. With the Owens Valley interferometer 
they mapped ^^CO (J = 1 ^ 0) in 19 fields with a 
1' field-of-view and a 7" synthesized beam. Their data 
analysis yielded a catalog of 38 GMCs with (0.2 — 4) x 
10^ Mq and a mass distribution index similar to that of 
the Milky Way. Although their observations covered a 
significant fraction of the central 2 kpc, they positioned 
most fields to include areas of high optical extinction 
detected in CO emission with the NRAO 12 m telescope, 
possibly biasing the cloud sample. This paper attempts 
to address this bias by presenting a complete survey of 
GMCs in M33 over most of the optical disk. 

2. OBSERVATIONS 

Observations were made with the BIMA array in its 
most compact ("D") configuration at various times be- 
tween 1999 August and 2000 October. As shown in Fig- 
ure n M33 was mosaiced using a grid of 759 pointing 
centers, spaced by 78" in a hexagonal pattern. Circles 
indicate the EWHM primary beamwidth of the BIMA 
antennas, which is 100" at 115.27 GHz. The grid spac- 
ing was chosen to cover the galaxy with as few point- 
ing centers as possible while maintaining a point source 
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Fig. 1. — Fields mapped wi th BIMA toward M33, overlaid on an 
Ha imago ( Cheng et al. 1996) of the galaxy. Circles show the 100" 
FWHM primary beamwidth of the BIMA antennas at 115.27 GHz. 




Fig. 2. — RMS noise level, in Jy beam~^, for the 2 km s~^ 
resolution CO data cube. The white line near the edge of the 
mosaic is the 99% gain contour; outside this contour the data have 
not been fully corrected for the primary beam response. Within 
the 99% gain contour the RMS noise ranges from 0.34 to 0.65 Jy 
beam~^; it is < 0.51 Jy beam~^ within the central half of the 
mosaic. 



sensitivity that varies by no more than 10% across the 
mosaic. 

The C0(1 0) line was observed in the upper side- 
band of the SIS mixers. Single sideband system temper- 
atures, scaled to the top of the atmosphere, were typi- 
cally 300 to 500 K at 115.27 GHz. The correlator cov- 
ered the LSR velocity range —414 to -1-59 km with 
1.015 km s~^ channel spacing, on each of the 45 baselines. 

Each night we cycled through observations of about 50 
grid points, using an integration time of 25 seconds per 
point. The slew and settle time of the telescope between 
integrations was 5 seconds, so a complete loop through 50 
grid points required about 25 minutes. Observations of 
the calibrator, 0136-f 478, were interleaved with complete 
loops. For a typical 6-hour track, each pointing center 
was observed 10-12 times. With 45 baselines, this yields 
about 500 measurements of the source visibility across 
the (m, v) plane, resulting in a nearly round synthesized 
beam with < 10% sidelobes. 

A total of 45 observing tracks was obtained, with du- 
rations of a few hours to 10 hours. The total useful inte- 
gration time accumulated on M33 was 132 hours, which 
corresponds to an average of 10.5 minutes per pointing 
center. 

2.1. Flux Scale 

For each observing track the amplitude gain of each 
antenna was scaled by a constant factor to obtain the 
correct visibility amplitude on the calibrator, 0136-1-478. 
The flux density of this quasar was variable over the 
course of the observations (2.3 Jy in 1999 September, 
3.0 Jy in 2000 May, 3.7 Jy in 2000 July, and 3.4 Jy in 
2000 Oct). We monitored the calibrator flux density by 
making a brief observation of the compact H II region 
W30H once per night. The flux density of W30H was 
measured to be 3.8 ± 0.2 Jy by comparison with primary 
flux calibrators Mars and Uranus. All comparisons with 
W30H were made using the lower sideband continuum 
channel of the receivers at 112 GHz, since measurements 
at 115 GHz are contaminated by CO emission from the 
extended molecular cloud associated with W30H. 



The flux densities derived for 0136-f 478 were generally 
reproducible to ±0.2 Jy from night to night. Allowing for 
possible systematic pointing errors and decorrelation by 
atmospheric phase fluctuations, we estimate that the fl- 
nal amplitude scale is uncertain by approximately ±15%. 

2.2. Mapping 

After calibrating the visibility data, we generated mo- 
saiced spectral line maps using the Miriad software pack- 
age^. The maps were deconvolved with a CLEAN 
algorithm in order to remove negative bowls around 
the strongest sources. The synthesized beam varies 
slightly across the mosaic because individual subgrids 
have slightly different {u,v) coverage. To handle this, 
Miriad creates a cube of synthesized beams, one for each 
pointing center, and interpolates among them when de- 
convolving the map. The maps were restored with a 13" 
Gaussian. 

Most of our analysis was performed on maps with 
2.03 km s~^ velocity resolution. These were produced by 
Hanning smoothing the (1.015 km s^^ spacing) visibil- 
ity spectra, then generating maps of alternate channels. 
The resulting channel maps are expected to be nearly 
statistically independent. 

To evaluate the noise level across the mosaic, we com- 
puted the RMS fluctuations in the spectrum correspond- 
ing to each map pixel, omitting any channels with flux 
densities S > 4a. From a spectrum with 222 statisti- 
cally independent channels, the noise is determined to 
approximately 5% accuracy. We then smoothed the map 
of the noise to 30" resolution (Figure 0). The white 99% 
gain contour denotes the effective outer boundary of the 
mosaic. Outside this contour intensities are not fully cor- 
rected for attenuation by the primary beam. For these 
2 km s~^ resolution channel maps the RMS noise level 
over the central half of the mosaic ranges from 0.34 Jy 
beam^^ to 0.51 Jy beam~^; the average value is 0.44 Jy 
beam^^, or 0.24 K. At the worst spots near the edge of 
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Fig. 3. — RMS noise as a function of Vlsr, for the 2 km s"-"^ 
resolution maps. The noise level is frequency-dependent because 
of gain variations across the instrumental passband. In addition, 
the noise is higher at the blueshifted end of the spectrum because 
of the rapid increase in atmospheric opacity above 115 GHz. 



the mapped region (but inside the 99% gain contour) the 
noise level is sometimes as high as 0.65 Jy beam^^. 

3. THE GMC CATALOG 

3.1. Identifying clouds 

Because the sensitivity varies across the mosaic, we 
searched for molecular clouds by clipping the maps at a 
constant signal-to-noise ratio instead of a constant flux 
density threshold. The intensity data cube is converted 
to a signal-to-noise data cube by dividing through by the 
noise at each position: 



where a{x, y) is the smoothed RMS noise map (Figure[5J 
and /(j^) is a frequency-dependent noise correction (Fig- 
ure which arises because of gain variations of 5 — 10% 
across the instrumental passband. Real CO emission 
occupies only a few of the ~ 20500 independent reso- 
lution elements in each channel map, thus having lit- 
tle effect on the renormalization. In the normalized, or 
signal-to-noise, data cube the intensity is expressed as a 
multiple of the RMS noise at each pixel. For example, 
s{x,y,v) — —3 represents a 3cr detection of a negative 
peak. 

The maximum and minimum normalized fluxes in each 
velocity channel are plotted in Figure^ CO emission is 
obvious in the range —260 < Vlsr < —105 km s^^. A 
histogram of the normalized fluxes in this velocity range 
(Figure is well-fltted by a normal distribution, apart 
from the wing of high signal-to-noise pixels attributable 
to CO detections. 

Clearly, one could comfortably accept as a CO de- 
tection any pixel with normalized flux > 5(7. Unfor- 
tunately, such a simple strategy would miss much real 
molecular emission - Figure |5l shows a clear excess in 
pixels with flux > 3.5(7. Since the velocity maps are 
nearly statistically independent, we can obtain a higher 
detection efflciency by searching for neighboring chan- 
nels with unusually high normalized fluxes. The lowest 
mass molecular cloud which our survey is likely to de- 
tect is of order 2 x lO'' Mg. A typical Milky Way GMC 
with this mass has a velocity dispersion of 1.8 km s~^ 
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Fig. 4. — Maximum and minimum normalized flux (signal-to- 
noise ratio) vs. Vlsr for the M33 CO data cube. Maxima are 
indicated by the histogram, minima by open circles. The channel 
spacing is 2.03 km s~^; the channels have been boxcar averaged 
to obtain a velocity resolution of 4.06 km s~^. Note that this is 
not the CO spectrum that one would observe toward M33 with a 
sensitive single dish; in a large beam the bright CO peaks would 
be greatly diluted by their small filling factors. 
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Fig. 5. — Histogram of normalized flux values in the signal-to- 
noise data cube, for 4 km s~^ channel maps, for the velocity range 
-270 < Vlsr < -100 km s-^ The bin size is 0.025, the total 
number of points is 20916120, and the smooth curve is a Gaussian 
of the form y = 209048 exp(-x2/2). The positive wing is due to 
CO emission from molecular clouds. Since the histogram plots 
^ 12 dependent pixels for each statistically independent beam, the 
scatter in the negative wing of the Gaussian is suppressed compared 
to if all the points were independent. 



(jSolomon et aZ.|[T987j) . which corresponds to a FWHM 
linewidth of 4.2 km s~^. Therefore, we expect that CO 
emission from most real clouds should appear in at least 
two adjacent 2 km s^^ velocity channels. To identify 
molecular cloud candidates, we wrote an IDL routine to 
locate pairs of adjacent channels with normalized flux 
> 2.7(7. Each pair, and all contiguous pixels with nor- 
malized flux > 2(7, are assigned to a candidate molecular 
cloud. These pixels are then masked to avoid double 
counting, and a search for another candidate cloud pro- 
ceeds until all pairs of adjacent > 2.7a channels have 
been masked. The assignment of pixels to candidate 
clouds in this manner does not depend on search order 
but occasionally links nearby sources which could be re- 
garded as separate clouds. Although we expect from our 
map statistics to find real CO emission only in the ve- 
locity range —280 < Vlsr < ~85 km s~^, the search was 
conducted over the full 450 km s~^ range of the data 
cube. A total of 1600 candidate sources were identified 
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in this way. 

To evaluate the Hkehhood that each candidate source is 
real, we examine the most significant spectrum through 
the cloud - either the one with the highest summed sig- 
nificance or the one with the greatest number of adjacent 
channels with flux > la. The calculation described in 
Appendix A evaluates the probability P that this spec- 
trum is a noise fluctuation, i.e. that the detection is spu- 
rious. As discussed in Appendix A, the likelihood that 
the cloud is real is then given by exp(— A^P) ~ 1 — A^P, 
where N is the number of statistically independent ways 
in which n adjacent channels can be selected from the 
data cube. For the full data cube N ~ ribeams ■ nchans ~ 
4.5 X 10^. Note that the likelihood computation is based 
on a single spectrum through each cloud, not on an av- 
erage over the source. This keeps the calculation simple, 
since adjacent spatial pixels oversample the synthesized 
beam and hence are not statistically independent. 

Originally, we intended to catalog all candidate sources 
with < 10% likelihoods of being noise fluctuations. To 
our disappointment, however, the algorithm identifled 
several sources which were almost certainly spurious - 
i.e., well outside the plausible range for CO emission 

- with false detection likelihoods of < 0.001. There 
are several possible explanations for this inconsistency. 
First, there may be outliers caused by a few bad visibil- 
ity measurements, perhaps from observations obtained 
in especially poor weather or at low elevation. Second, 
we may be underestimating the number A'^ of statisti- 
cally independent resolution elements in the data cube. 
Nyquist sampling the highest spatial frequency visibil- 
ity data (corresponding to antenna separations of 11.5 
kA) requires a 9" interval, which would imply there are 
~ 50000 independent resolution elements in the map. 
We believe the actual number is close to our original es- 
timate of 20500 because the long baseline data receive 
little weight in the flnal image. Third, there may be er- 
rors in normalizing the maps by the rms noise. Such 
errors are particularly likely for the frequency dependent 
correction (Figure because there were slow changes in 
the passband gain over the 14 month period in which 
observations were made. 

Because the likelihood calculation is imperfect, we use 
it only to rank the relative likelihoods of the candidate 
sources. For convenience we define a 'likelihood measure' 
as — In(A'^P). Figure plots this quantity vs. LSR ve- 
locity for the 1600 candidate sources. Sources with the 
60 highest likelihood measures (— In(A^P) > 13.4) fall 
in the velocity range —260 < Vlsr < —105 km s~^and 
are apparent in multiple channel maps, i.e. they are un- 
questionably real. As the likelihood threshold is lowered, 
we begin to find a few sources that probably are spuri- 
ous, since they lie well outside this velocity range. At 

— In(A^P) ^ the number of obviously spurious sources 
rises sharply, indicating the noise floor of the data cube. 

For our primary GMC catalog, we accept all cloud can- 
didates in the velocity interval —280 < Vlsr < —85 
km s-i for which -\ii{NP) > 5.83. A total of 93 
sources fit these criteria. It is straightforward to estimate 
the number of false detections in this catalog simply by 
counting the number of sources above the likelihood cut- 
off which lie outside the 96-channel search range. There 
are 3 spurious detections in the 48 channels just below 
the search range, and 3 spurious detections in the 48 
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Fig. 6. — Likelihood measure In(AfP) as a function of Vlsr for 
the 1600 cloud candidates. Large values of — ln(7VP) represent the 
most significant clouds. Only detections within the velocity range 
-280 km s-i < Vlsr < -85 km s"! are included in the catalog. 
The remainder of the velocity range is used to estimate the number 
of false detections at a given significance. Clouds included in the 
main catalog are marked with large dots. 



channels just above the search range. Hence, we expect 
that there are of order 6 spurious detections in this cat- 
alog. 

One can identify fainter sources by narrowing the 
search range to cover only velocities that lie within the 
H I emission line profile in each direction. Figure H 
shows the cloud likelihoods from Figure plotted as 
a function of the H I-CO velocity difference. H I 
velocities were obtained fro m the Westerbork map of 
IDeul fc van der HulstI l|1987j) . which has 12 x 24" reso- 
lution. For the 93 clouds in the primary catalog, the 
velocity difference exceeds cthi for only 5 sources, where 
fiHi is the velocity dispersion of the H I line. The four 
sources with velocity differences > 2(Thi may be false de- 
tections. 

Narrowing the search range to ±2o'hi ~ ±15 km 
from the H I velocity, we identified 55 additional clouds 
with likelihood measure — In(A^P) > 1.4. Figure |H1 shows 
these sources on a likelihood-velocity difference plot. 
Once again, by counting the number of (presumably) 
spurious sources outside the velocity search range, we 
estimate that there are of order 9 false detections in this 
catalog extension. 

It is not useful to catalog individual clouds with like- 
lihoods — In(A^P) < 1.4 because the number of spurious 
sources is quite large. Nevertheless, in Figure |S| there 
is an obvious overdensity of GMC candidates with Vlsr 
within 2a HI of Vhi, and this may be used to gauge the 
number of cloud candidates which are falsely rejected. 
There are 145 candidate sources with 2 <| Vco — Vhi | 
/(Thi < 4 which are presumably spurious. One expects 
to find an equal number of spurious sources in the range 
I Vco — I /fffl < 2. Since there are actually 205 can- 
didate sources in this range, we estimate that roughly 60 
clouds have been falsely rejected from our GMC catalog 
in this significance range. This result is not particularly 
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Fig. 7. — Likelihood measure vs. the CO-H I velocity difference, 
normalized by the H I linewidth. The most significant detections 
are within 3 velocity widths of the H I velocity. 
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Fig. 8. — The 93 clouds in the main catalog are supplemented by 
55 detections selected by their proximity to the H I velocity. There 
is a significant overdensity of clouds near the H I velocity, reflecting 
the presence of real clouds that are indistinguishable from noise. 
Because of the narrow velocity range, we estimate that fewer than 
9 of these clouds are false detections. 



sensitive to how we select boundaries for the overdensity 
band in Figure IS] 

3.2. Cloud Positions, Velocities, and Masses 

Figure El shows the most significant CO spectrum 
through each cloud, overplotted on the H I spectrum in 
the same direction. Table lists the positions and prop- 
erties of the 148 clouds identified in our survey, ranked 
by likelihood. Sources 1-93 were chosen without regard 
to the CO-H I velocity difference, and hence are suitable 
for statistical tests against the H I distribution. 

The position and velocity of each cloud listed in Table 
^ are intensity-weighted averages over all pixels which 
constitute the cloud i.e., x = Tx) / T). The veloc- 



Table 1 

Summary of cloud parameters for HCRO 
observations. 

Tabic omitted for brevity. The tabic is available at 
http : //astro .berkeley. edu/~eros/m33 .table .txt 



ity width is derived from the intensity-weighted second 
moment of the velocity distribution. 



Awfwhm — 



;ln2 



0.5 



(2) 



To estimate the cloud's mass, we convert the CO 
integrated intensity for each pixel to an H2 col- 
umn density 7V(H2) = T{x,y,v)Av ■ X, where 
Av is the channel velocity width, 2.032 km s~^, 

and where we assume a con version factor X = 

2 X 10^° km s~M llSoIomon fi^ oil IToSTL 

Strong et all 119881: IStrong fc Mattoxl Il996t iDigel et all 
199^ The higher resolution observations presented in 



Paper II imply that this conversion factor is consistent 
with the virial masses of clouds out to radii of 4.5 kpc, 
despite variations in metallicity. Each pixel corresponds 
to a physical a rea D^AaAS, where D is the di stance to 
M33, 850 kpc l)Lee. Freedman. fc Mador3ll993l and ref- 
erences therein), and Aa and AS are the extent of the 
pixel in angle, both of which are 4". Thus, the total 
cloud mass is 



M 



pixels 



1.36-2 



rriH ■ X ■ TpixeiAw 



D^AaAS. (3) 



The factor 1.36 is the mass correction for helium with 
a number fraction of 9% relative to hydrogen nuclei. If 
expressed in terms of flux density rather than brightness 
temperature, this is equivalent to 



M = 7.5 X 10^A/o/(Jy km s"^). 



(4) 



This is 25% lower than the conversion factor adopted by 
iWilson fc^ covillc ( 1 99Q) , mostly because of our choice 
of a lower X-factor. 

3.3. Completeness 

To gauge the completeness of the survey, we added 
a regular lattice of artificial test sources to the cube of 
channel maps and computed the fraction of them recov- 
ered by our search algorithm. Each test source was a 
13" X 13" Gaussian, corresponding to a CMC unresolved 
by our synthesized beam, with a Gaussian velocity pro- 
file. The velocity width was kept constant at 6 km s~^ 
FWHM, independent of the flux density (mass) of the 
test source. Artificial sources were spaced by 100" in 
right ascension and declination, over the entire mosaic; 
and by 56.3 km s~^ in velocity, from —345 to —11 km s~^. 
Sources close to the edge of the mosaic were omitted if 
more than 5% of their flux lay outside the mosaic's 99% 
gain contour. In total, 2625 artificial sources were added 
to the data cube. Source recovery tests were done with 
masses ranging from 3 x 10^ Mq to 10^ Mq. 

Figure EH shows the fraction of the test sources de- 
tected as a function of the local RMS noise level. Over 
the full mosaic, the survey is complete to 1.5 x 10^ AIq. 
Within the central half of the mosaic, where the noise 
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Fig. 9. — Spectra for the first 50 clouds in the catalog. The channels included by the cloud identification algorithm are shaded and the 
grey line represents the H I line profile, scaled to have equal amplitudes for the first cloud. 




Fig. 10. — Fraction of artificial clouds recovered as a function 
of RMS noise, for 5 cloud masses. The solid curve shows the cu- 
mulative distribution of noise in our mosaic, within the 99% gain 
contour. Clouds with masses > 1.5 X 10^ Mq will be detected vir- 
tually anywhere in the map; most clouds with mass > 1.0 X 10^ Mq 
will be detected within the inner half of the mosaic. 



is typically < 0.5 Jy beam ^, we recover approximately 
90% of 1 X 10^ Mq clouds. Outside this region, we recover 
70 — 80% of such clouds. 

Figure ITTI shows the recovered mass versus the actual 
mass of the test sources. Our algorithm underestimates 
cloud masses because it integrates the CO luminosity 
only over pixels inside the 2a brightness contour. The un- 
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Fig. 11. — Mass recovered as a function of test cloud mass. 
Inverting this relationship for masses above 5 X 10'* Mq determines 
the correction factor applied to the masses in Table u\ 



derestimates are particularly severe for low mass clouds, 
for which much of the emission is beneath below the 2a 
noise level. For clouds at the threshold of detectability, 
~ 7 X 10* Mq, we recover about half the mass, whereas 
for clouds more massive than 2 x 10^ Mq, we recover more 
than 80%. The average correction factors from Figure ITTI 
were applied to our catalog sources to obtain more accu- 
rate mass estimates; these are given as Mqq in column 
(8) of Table 1. 
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3.4. Extended sources 

The completeness estimates discussed above are ap- 
plicable only to sources that are unresolved by the 13" 
synthesized beam, i.e. to GMCs <C 50 pc in diameter. 
Because an aperture synthesis telescope acts as a spatial 
filter, attenuating low spatial frequencies, the sensitivity 
to larger sources is reduced. Essentially, the synthesized 
beam consists of a narrow positive lobe surrounded by 
a broad negative bowl. If one convolves an extended 
source brightness distribution with this beam, the nega- 
tive contribution from the bowl partially cancels the pos- 
itive contribution from the central lobe, reducing the ob- 
served flux density. For the BIMA D-array, the observed 
flux density for a 40" FWHM Gaussian source is about 
half the true flux density. Thus, our survey may not 
detect a group of molecular clouds with a total mass of 
1 X 10^ Mq spread out over a 40" (150 pc) region, though 
it is quite likely to detect a single GMC with this mass. 
For sources with radii comparable to the beam size, their 
surface brightness will be diminished relative to that of 
a point source by a factor of ~ 2. Fortunately, sources of 
this size are also significantly more massive (10^ Mq, Pa- 
per II) than our completeness limit and would be readily 
detectable. 

Single dish observations of M33 with the University of 
Arizona Steward Observatory 12-m telescope were used 
to estimate the total fraction of the CO flux recovered 
by the BIMA survey. The single dish flux calibrations 
were repeatable within 6%, and agreed with BIMA flux 
measurements of planetary and spectral line calibrators 
within 10%; further observational details are given in Pa- 
per II. Two sets of single dish observations were made. 
First, 18 positions with BIMA catalog sources were ob- 
served. For these fields the BIMA D-array maps recov- 
ered approximately 60% of the single dish CO flux. CO 
was detected in just 2 of the 18 off-source spectra ob- 
tained in these position-switched observations; the 16 
non-detections may be used to set an upper limit of 0.3 
Mq pc~^ for the surface density of an extended (i? < 4 
kpc) uniform molecular disk. Second, we observed 15 
fields to map a 7.5' cut along the major axis of M33. In 
Figure El we show a composite single dish spectrum de- 
rived by shifting the spectrum for each of the 15 fields to 
a common velocity and averaging. The integrated flux of 
the of the 12 m spectrum is 1.2 K km s^^ corresponding 
to ~5.2 Mq pc~^. This corresponds to 1.6 x 10'' Mq of 
molecular gas in the cen tral 7' diameter of the galaxy. 
iWilson fc Scovilld l)1989|) measure 1.9 x lO'^ Mq in the 
same region when converted to our A"-factor, distance, 
and removing their correction for source-beam coupling. 

For comparison between the 12 m and the interferom- 
eter, we show composite spectra made from BIMA data 
within the 12 m fields, convolved to the 54" FWHM 12 m 
beam, sampled at the centers of the 12 m beams, and av- 
eraged together in a fashion identical to the 12 m spec- 
tra. Two BIMA spectra were synthesized, one from the 
catalog source pixels only (thick line), another from the 
data cube clipped at 2a (dotted line). The BIMA catalog 
spectrum contains 43% of the single dish flux, the clipped 
spectrum, 54%. In Paper II we argue that most of the 
missing flux is attributable to complexes of undetected 
low mass clouds associated with catalog sources. 

We can estimate the total molecular mass of M33 from 
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Fig. 12. — A comparison of the flux selected in the D-array sur- 
vey with that observed by the UASO 12-m telescope. A composite 
spectrum (solid line) is shown for 15 single dish fields on a 7.5' 
cut along the major axis of M33. This is derived by shifting the 
spectrum for each of the fields to a common velocity and averaging. 
The integrated intensity of the 12 m spectrum is 1.2 K km s~^ cor- 
responding to ~ 5.2 Mq pc~^. For comparison, composite spectra 
are shown for BIMA data within the 12 m fields, convolved to the 
54" FWHM 12 m beam. Two BIMA spectra were synthesized, one 
from the catalog source pixels only (thick line), another from the 
data cube clipped at 2a (dotted line). The BIMA catalog spectrum 
contains 43% of the single dish flux, the clipped spectrum, 54%. 



the major axis cut. The integrated single dish molecular 
mass in this 7.5' long strip is 2.8 x 10^ Mq. The corre- 
sponding BIMA mass, considering only cataloged clouds 
with masses greater than 1.5 x 10^ Mq, the complete- 
ness limit for our survey, is 9.2 x 10^ Mq, 33% of the 
single dish mass. In this computation we have weighted 
the cloud masses by the appropriate 12-m beam attenu- 
ation factors. The total (corrected) mass in the catalog 
is 2.3 X 10^ Mq, of which 1.5x10^ Mq is from clouds 
more massive than the completeness limit. If the GMC 
mass spectrum in the central cut is similar to that for the 
galaxy as a whole, then we infer that the total molecular 
mass is 3 times this value, or 4.5x10^ Mq. 

3.5. Comparison with Wilson and Scoville 1990 

IWilson Sc^^^ llT99nl hereafter WS90) used the 
Owens Valley array to map 19 flelds in the nucleus of 
M33 with 7" resolution, identifying 38 molecular clouds. 
Figure El is an overlay of the WS90 flelds on the central 
10' X 10' section of the BIMA survey. Red contours show 
the outer boundary of clouds identifled in the BIMA sur- 
vey; solid circles mark locations of clouds cataloged by 
WS90. Frequently, a BIMA cloud corresponds to two or 
more WS clouds - for example, the complex WS1-WS5 
at the center of the galaxy is classifled as one cloud in 
our survey because these sources are connected by low- 
level emission and lower BIMA resolution blends these 
clouds together. The BIMA survey detects all of the 
WS90 clouds with masses > 2 x IO^Mq. 

To examine a few of our clouds more closely, and to 
investigate discrepancies between our catalog and the 
WS90 catalog, we obtained higher resolution BIMA ob- 
servations of the three fields shown by dashed circles in 
FigureEl These fields were observed on 3 nights in 2000 
July with the array in its "C" configuration. The inte- 
gration time was approximately 6 hours per field. The 
C-array observations were calibrated and combined with 
the D-array data to generate maps with a 6.5" synthe- 
sized beam. The RMS noise level in each 2.03 km s~^ 
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Fig. 13. — Comparison of BIMA and WS90 molecular clouds in 
M33. The map covers the central 10' X 10' region of M33. The 
gray scale shows a BIMA D-array image generated by masking 
out all emission < 2a in individual 2.03 km s~^channel maps, 
then integrating over the velocity interval —240 < Vlsr< —128 
km s~^. Red contours show the outer boundaries of BIMA clouds 
determined by our search algorithm. Open blue circles show the 19 
fields mapped by WS90; sources cataloged by WS90 are shown as 
filled blue circles, with area proportional to mass. Magenta circles 
indicate the 3 fields mapped in the BIMA C-array, as described in 
the text. 



channel map was 0.15 Jy beam~^, approximately 0.32 K, 
comparable to that of WS90. The C+D-array maps are 
compared with the D-array survey images in Figure H"!] 

Generally there is good agreement between sources in 
our C+D array images and the WS90 positions, although 
we fail to confirm some of the less massive WS90 clouds 
(e.g., WS21, WS22, WS36). One can find several exam- 
ples showing how the D-array survey blends WS sources 
(e.g., WS27 and WS28 = catalog source no. 17; WS6 -I- 
WS8 = catalog source no. 74). 

Figure [TBI compares the masses of GMCs which can be 
clearly identified in both data sets. As indicated by the 
dashed line, the BIMA masses are expected to be system- 
atically lower by 25% because we used a lower conversion 
factor from flux density to mass. Generally the BIMA 
and WS90 masses are in reasonable agreement, although 
there is a good deal of scatter in the ratio, probably be- 
cause of the difficulty in establishing the outer bound- 
aries of clouds. 

Small clouds detected by the C-array observations are 
not included in our survey catalog (Table^ but a further 
analysis of these clouds is included in Paper II. 

4. CLOUD PROPERTIES 

4.1. GMC Mass Spectrum 

Histograms of the mass distribution of GMCs in our 
catalog are presented on a log-log scale in Figure^l The 
dashed line histogram is the observed spectrum of GMC 
masses uncorrected for signal beneath the 2a clipping 
level, as described in H3.3I The solid line histogram is 
the spectrum of corrected masses. A power law of the 
form 

dM \Mq^ 



was fit to the mass bins for which the catalog is complete 
(M > 1.5 X 10^ Mq). The observed and corrected mass 
distributions are well fit with similar indices (a = 2.6 ± 
0.3) but different normalization constants (K — 1.2 vs. 
1.8 X 101°Mq^). 

For a mass spectrum with a < 2, most of the inte- 
grated mass resides in the most massive clouds. For a 
mass spectrum with a > 2, most of the mass is in the 
least massive clouds, and the total mass diverges when 
extended to arbitrarily small mass. Hence, for M33 there 
must be either a lower mass limit for molecular clouds 
or a break in the distribution where a changes with de- 
creasing cloud mass to less than 2. If the integrated mass 

Mtotai = MdN^ M( — \dM 

'J low limit <J low limit \ "--^^^ / 

(5) 

is distributed according to 



dN 
dM 



^ 1.8 X 101° (^— j [MqI] (6) 



we compute a strict lower mass limit Ml of 4 x lO** Mq 
for a = 2.6 and Mtotai = 4.5 x lO'^ Mq. More likely, 
there is a "turnover" mass, where the mass spectrum be- 
comes flatter than 2. Since we can put a firm lower limit 
on the cutoff mass of 4 x 10'* Mq and an upper limit of 
10^ Mq, where the measured mass distribution is nearly 
complete, the turnover mass of GMCs is ^ 7 x 10^ Mq 
with an uncertainty of 3 x 10* Mq. This turnover repre- 
sents a "characteristic" mass for the molecular clouds in 
M33 that any theory of GMC formation must be able to 
reproduce, given the physical conditions in M33. 

The highest mass we measure for a GMC in M33 is 
7.3 X 10^ Mq; there are two GMCs with this mass. Nei- 
ther is close to the galactic center nor to an especially 
bright H II region. In total, we find five GMCs more 
massive t han the highest mass clo ud (4 x 10^ Mq) re- 
ported bv I Wilson fc Scovilld l)1990() . Moreover, they find 
a lower value of a (1.7), which may have resulted from 
preferentially targeting massive clouds by observing CO- 
bright single dish fields. 

4.1.1. M33, Milky Way, and the LMC 

Over a similar mass range, the power law distribu- 
tion of M33 is steeper than that of the Milky Way 
(1.6, Solomon et al. 1987; 1.9, Heyer, Carpenter, & 
Snell 2001). If M33 were as massive as the Milky Way, 
the normalization constant of its power law distribution 
would be 3.6 x 10" Mg^ and {dN/dM)AM ~ 1.0 
at 1.0 X 10*5 Mq. In contrast, the Milky Way is es- 
timated to ha ve more than 100 GMCs with M > 
1.8 X 10*^ Mq l)Dame et al. Ill986j) . We conclude that 
M33, even though it has m any bright H II regions 
Ulsra el fc van der KruitI I19741) . has significantly fewer 
massive GMCs. As we infer in ij5.6l the major- 
ity of H II regions must be (1) fueled by molecular 
clouds below our completeness threshold or (2) capa- 
ble of efficiently dispersing their parent CMC com plexes 
ijHartmann. Ballesteros-Paredes. fc BergirJ |2001|) . Ex- 
trapolating the observed power law index to a cut-off 
3.0 X lO'* Mq, we find J{dN/dM) dM - 2000. This 
is a sufficient number of molecular clouds to match ap- 
proximately the observed number of H II regions. 
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Fig. 14. — Comparison of BIMA D-array and C+D-array maps of 3 fields. BIMA catalog sources are shown by red dots, WS90 sources 
by blue dots. 



The NANTEN su rvey o f CO in the LMC revealed 168 
GMCs (|Fukui et alJl200H) . The cloud sample has a mass 
range and completeness limit similar to the BIMA sur- 
vey. The mass spectrum of GMCs in the LMC has an 
index of 1.9, similar to that of the outer Milky Way 
ijHever. Carpenter, fc Snelill200H) . indicating that most 
of the mass resides in high mass clouds. The physical 
meaning of the difference between the mass indices of the 
LMC and M33 is unclear. However, as in M33, molec- 
ular clouds in the LM C appear to be tigh tly correlated 
with H I overdensities l|Mizuno et al.ll200lD . and H I fila- 
ments underlie bright complexes of GMCs f H5.3(l . Hence, 
we expect the details of CMC formation to be at least 
somewhat similar for M33 and the LMC. 

4.1.2. Cloud Masses versus Radius 

Figure El shows the distribution of catalog CMC 
masses as a function of galactocentric distance. The dis- 
tribution of clouds with masses greater than our com- 
pleteness limit (1.5x10^ Mq) appears to cut off abruptly 



beyond 4 kpc. For Rgai > 4.5 kpc, we detect only a few 
low mass clouds despite considerable reserves of H I here 
f ii5.ll) . However, higher sensitivity measurements are re- 
quired to determine if we are indeed observing a sharp 
edge to the molecular disk or a shift to the production 
of predominantly low mass clouds. For Rgai < 4.5 kpc, 
the mass distribution appears more or less constant to 
the eye; i.e. the apparent density of points in Figure [T7I 
appears constant with R for a given mass. 

In a more quantitative comparison, we looked for spa- 
tial variations in the CMC mass spectrum by partition- 
ing our catalog into an inner and outer disk sample, 
each containing an equal mass of molecular gas in cata- 
log GMCs. The radius at which this occurs is 2.0 kpc. 
This coarse radial binning assures adequate statistics for 
M > 1.5 X 10^ Mq in two distinct cloud populations. 
Power law fits give a — 2.8 ± 0.3 for the inner disk and 
a = 2.4 ± 0.3 for the outer disk. A 2-sided K-S test 
shows that the two cloud samples are drawn from the 
same population with a 9% likelihood. Though there is 
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Fig. 15. — Comparison of CMC masses from the BIMA D- 
array survey and the WS90 survey, for cases where the same source 
clearly is detected by both surveys. The dotted line shows the 
expected slope, given the different mass conversion factors used in 
the two surveys. 
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Fig. 16.— The mass distribution dN/dM of the CMC catalog. 
Solid line histogram shows the distribution of masses co rrected 
for emission profile truncation at 2a. (Refer to Figure inT ) Dot- 
ted line histogram shows the distribution of uncorrected masses. 
The histograms have been accumulated into half-octave mass bins 
(AlogAf = 0.51og^Q 2), where the zero reference is 10^ Mq on the 
abscissa and 10~^ ^^0^ ordinate. Power law fits show that 

the mass correction has little effect on the slope of the distribution 
for M > 1.5 X 10^ Af0, but increases the normalization coefficient 
30%. 



some evidence for a difference in the mass distribution, 
it is only marginally significant. 

5. SPATIAL COMPARISON OF GMCS, H I STRUCTURE, 
AND H II REGIONS 

5.1. Radial Profiles and Star Formation Rates 

For galactic radii R < 7 kpc, we derived the az- 
imuthally averaged mass surface density from our cata- 
log GMCs. The results are presented in Figure [THl along 
with the surface density of H I mass and Ha flux. Points 
represent averages within deprojected rjidial bins of AR, 
= 0.2 5 kpc. The H I data are from .Cor belli & Salucci 
l)2000tl and the Ha data are deri ved from the optica l 
H II region catalogs compiled bv iHodge. et al.1 l)1999D . 
All three components of the ISM are well fit by an ex- 
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Fig. 17. — Catalog GMC mass (Mqq) as a function of radius. No 
high mass clouds are found beyond 4.5 kpc. Within this boundary 
the mass distribution is roughly independent of radius. Survey cov- 
erage is complete to 5.2 kpc and more than 50% complete between 
5.2 kpc and 8 kpc. Accounting for incomplete coverage should, at 
most, double the number of clouds seen in this range. 



ponential disk = Eq exp(— i?/i?o). Only the fit 

to the molecular data is shown (dot-dash line) in the 
figure. The molecular surface density 5]h2 (squares) de- 
clines exponentially with Rq — 1.4±0.1 kpc, from a peak 
mass surface density 2.15 Mq pc^^. CorbeUi (2003) ob- 
tained a total molecular gas mass of 2 x IO^Mq from an 
FCRAO survey. Accounting for the difference in X fac- 
tors llCorbe lli (2003) used 2.8 x 1020cni-V(K km s"^)), 
her reported mass is 3 times the value we derive. The 
surface mass densities in the central region are compa- 
rable, but the scale length in iCorbcUi (20Q3) is 2.5 kpc, 
resulting in a significantly larger integrated mass. The 
reason for the difference in the derived scale lengths is 
unclear and merits further investigation. The H I sur- 
face brightness (dashed line) is nearly flat for i? < 6 kpc 
with a scale length lOlj kpc and a peak surface density 
of 9.5 Mq pc~^. Ha surface brightness (solid line) has a 
scale length of 1.7 ± 0.2 kpc and a peak surface bright- 
ness of 1.2 X 10'^^ ergs s~^ pc~^, corresponding to a star 
formation rate of 9.5 xlO~^''M0 yr~^ pc~^. 

Kennicutt (1989,1998) has argued that there is a sim- 
ple empirical relation between the SFR and the surface 
density of gas, Sgas, which can be parameterized by a 
power law SFR cx (Egas)". The SFR is most conve- 
niently calculated by scaling the Ha line emission, pow- 
ered by OB stars. Similar scale lengths for Ha and Eh2 
in M33 imply n — 0.9 ± 0.1, consistent with a constant 
SFR p er unit molecular mass. Similarly, Wong fc Blitd 
()2002j) find for a sample of 7 normal spiral galaxies 
that EsFR strongly correlates with Eh2 only, where the 
power law ind ex n ^ 0.8 — 1.4. Integrating Lhq for 
the catalog of 'Hodge, e t alJ ()1999 ^ and using Equation 
2 of Kennicutt (1998), we compute a star formation 
rate of 0.24 Mq yr^^ and a molecular gas depletion 
time of 1.9 x 10* yr for M33. This value is a factor 
of 1.5 - 4 smaller than the depletion times quoted in 
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Fig. 18. — The (deprojected) radial profiles of the molecular, 
atomic, and ionized media. All three are well fit by an exponential 
disk and can be characterized by a peak value and a radial scale 
length. Only the fit to the molecular data is shown (dot-dash line). 
Points represent averages within deprojected radial bins with A_R = 
0.5 kpc. The molecular data (squares) are well fit by an exponential 
disk for ij < 4 kpc with a scale length of 1.4 ± 0.1 kpc and a peak 
mass surface density, Sh2 = 2.15 Mq pc~^; Eh2 drops sharply 
at ii = 4 kpc. The H I surface brightness (dashed line) is nearly 
constant for i? < 6 kpc with a scale length 10.6 ± 2 kpc and a 
peak surface density of Ehi(R = 0) = 9.5 Mqpc~^ Ha surface 
brightness (solid line) has a scale length of 1.7 ± 0.2 kpc and a 
peak surface brightness of 1.2 X lO^^ergs s~^ pc~^. 



IWilson. Scoville. fc Ricd ijlOQlj) who consider individual 
molecule-rich complexes as opposed to t he galaxy as a 
whole. Our results, along with those of 'Wong & Blit^ 
(|2002), are inconsistent with those of Kennicutt (1989), 
who finds for disk-averaged observables that the SFR cor- 
relates more strongly with the surface density of either 
H I or H I+H2 than with that of molecular gas alone. 

5.2. Circular and Radial Motions of Molecular and 
Atomic Hydrogen 

We derived the rotation curve for GMCs cataloged in- 
dependently of Vhi (catalog sources 1 - 93) and for the 
H I emission along the same line-of-sight from the West- 
erbork data cube (Dcul & van dcr Hulst 1987). The ob- 
served source positions {x,y) were deprojected onto the 
plane of the galaxy assuming a P. A. of ~ 21° and an incli- 
nation angle of ~ 51°. These values were varied slightly 
with galactic radius in accordance with the tilted ring 
model of Corbclli & Salucci (2000). The rotation center 
was assumed to coincide with the optical center. At the 
deprojected positions the rotational velocity 
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where 9 is the azimuthal angle of the source measured in 
the plane of the galaxy from the the major axis. Sources 
within 45° of the minor axis were not included because 
of the large correction to {Viosix, y) — Vgys)- 

Figure E| shows the circular velocities for catalog 
GMCs, averaged in 250 pc bins. The median CO ro- 
tation curve (open squar es) closely matches the H I rota- 
tion curve determined bv lCorbelli fc Saluccil l)200(]j) from 
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Fig. 19. — Radial profiles of circular and radial motion 
of primary catalog GMCs, averaged over deprojected annuli of 
Ai? = 250 pc. Model rotation curve of H I also shown (dash). 
To measure the radial motions, only clouds within 30° of the mi- 
nor axis were included. 



Westerbork data. The dispersion of GMCs about the 
H I rotation curve varies between 7.5 and 9.0 km s~^ for 
R < A kpc. There is no apparent trend in the disper- 
sion with galactic radius. These values are comparable to 
Milky Way values after streaming in the Galaxy is taken 
into account. 
Figure [T^ also shows the CO radial velocities 
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including only sources within < 30° of the minor axis. 
We find little evidence for large sca le radial (non-circular) 
motions in M33 in agreement with lDeul fc van der Hulsn 
0987). 

5.3. Spatial Comparison of GMC Distribution with HI 

Filaments 

An extraordinary spatial correspondence between 
GMCs and the distribution of atomic hydrogen is shown 
in Figure 1201 The halftone image is the H I brightness 
distribution, derived by integrating the Westerbork data 
cube (Deul & van der Hulst 1987); the resolution of the 
Westerbork map is 24" x 24", comparable to the BIMA 
data. The H I distribution is dominated by bright, nar- 
row filaments in an extended disk. The dark circles mark 
the positions of the catalog GMCs; cloud masses are pro- 
portional to circle areas. Figure 071 shows that nearly all 
GMCs are found in regions of H I overdensity, but there 
is no correlation between the magnitude of the overden- 
sity and the cloud mass. Apparently, the amount of am- 
bient H I is independent of the mass of the GMC that 
forms from it. The correspondence between GMCs and 
filaments persists down to smaller scales in the central 
region where the H I brightness map has less dynamic 
range. The association with the filaments is even more 
pronounced in the high resolution H I maps of M33 made 
at the VLA ( Thilkcr private communication) . The aver- 
age H I mass located within 150 pc of our catalog sources 
is 5 X 10^ Mq, indicating an ample local supply of atomic 
hydrogen from which observed GMCs could have con- 
densed. 

In Figure 1221 we see that GMCs form from H I with 
an column density threshold of ~ 5 x lO^*' cm^^, which 
is comparable to the mean H I column density in the 
Ha disk (dashed histogram). The shaded histogram 
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Fig. 20. — Color image of H I 21 cm emission from lDeul fc van der HulsH 119871) with catalog sources overlayed. All molecular clouds 
lie in regions of H I overdensity. The area of the molecular cloud has been scaled to represent the relative masses of the clouds. The 
coincidence of molecular clouds with H I overdensity is evidence that clouds form out of the atomic gas. 



indicates pixels with associated CO emission. The 
solidline shows aU WSRT pixels IPeul fc van der Huls^ 
il987tl. The th reshold value is similar to that found by 
iSavaee. et aTl l)1977|) and lBlitz. Bazell fc D esert (1993) 
in the solar vicinity, and is indicated in Figure [221 More- 
over, the H I shows an upper limit in column density of 

3 X 10^^ cm~^. Ay ~ 3, often at positions associated 
with molecular gas. This suggests that the atomic gas is 
providing shielding for the molecular gas, but higher col- 
umn densities result in more gas in the molecular state. 

The close association between GMCs and H I filaments 
can be used to set an upper limit on the GMC lifetimes. 
The RMS difference between the CO and H I velocity 
centroids is 8 km s~^, calculated from clouds 1-93, which 
were selected independent of the H I velocity. The typi- 
cal wi dth of an H I filament in the lDeul fc van der Hulsfl 
l)1987(l map is 200 pc. If cloud lifetimes were signifi- 



cantly longer than 10-20 Myr, clouds would drift off the 
filaments, destroying the observed association. 

5.4. Implications for GMC Formation 

lElmeereei] ((T990|) reviews many theories of GMC for- 
mation, iiichiding_the_ag of small molecular 
clouds ijScoville fc Hershlll979() . However, there can be 
little doubt that the GMCs in M33 form out of the atomic 
gas. If instead the atomic gas resulted from the disso- 
ciation of molecular gas, the H I would appear as dis- 
crete envelopes around the GMCs rather than as fila- 
ments, and there would little atomic gas beyond the ap- 
parent edge of the molecular disk at i? = 4 kpc. Nei- 
ther of these features are seen. In addition, there is 
significantly more atomic gas a factor of 5) in the 
filaments than can be associated with discrete envelopes 
around GMCs and < 15% of this mass can be attributed 
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Fig. 21. — Plot of the mass of a GMC against the H I overdensity 
at the galactic radius of the cloud. The overdensity is defined as the 
H I surface density observed at the position of the cloud divided 
by the azimuthally averaged H I surface density found at that 
radius. The plot shows that there is no relation between these two 
quantities. Note that a few low mass clouds are found in regions of 
H I underdensities. These may be among the clouds in the catalog 
that are not real. 
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Fig. 22. — Histogram of H I column densities in the WSRT 
map of M33 IDeul L van der Huls3 119871) . The solid histogram 
shows the column densities for pixels through the entire map. The 
dashed histogram represents pixels within 6.7 kpc of the center 
(the Ha disk). The shaded histogram represents pixels associated 
with CO emission, that is pixels within the 2cr boundaries of our 
148 cataloged clouds. The dotted vertical line at 5 X lO'^" cm~^ is 
the thre shold column d ensity for molecule formation in the solar 
vicinity ISavage. et al. iflQTWl . 



to photodissociation ijWilson fc ScovilI3ll99lD . Further- 
more, if GMCs formed by the colhsional aggfomeration 
of smaUer molecular clouds, the growth thiie would be 
comparable to the collision time. This is roughly equal 
to the collision time of the catalog clouds, which is 2 x 10^ 



years based on 140 GMCs inside a radius of 4 kpc, an 
H2 half thickness of 100 pc and a cloud-cloud veloc- 
ity dispersion of 10 km s~^. Since the number density 
of clouds scales roughly as l/Mdoud and the geometric 
cross section scales as Mdoud (owing to GMCs having 
a constant surface density, see Paper II), this collision 
time is independent of mass and characterizes the growth 
time for all self-gravitating clouds. Gravitation effects 
do not significantlv enhance the collision cross section 
llScoviUe fc HeTshl Il979t IBlitz fc Shul IToSf? ) . Addition- 
ally, Paper II sets an upper limit for the diffuse II2 sur- 
face density of 0.3 Mq pc~^ Forming a 10^ Mq GMC 
from a diffuse molecular component at this surface den- 
sity would require almost 10^ yr to accumulate material 
from a 300 pc radius region, provided that the gas ve- 
locity dispersion could be converted into ordered motion. 
Both times are considerably longer than the lifetimes of 
the GMCs ('--^ 2 X 10^ yr) based on their association with 
the H I filaments. 

It appears that the process of GMC formation in M33 
is first and foremost a process of H I filament formation. 
While a high H I column density is a necessary condition 
for the formation of a GMC, it clearly is not a suffi- 
cient condition, since there are many positions with high 
H I column densities but no observed CO emission (Fig- 
ure I22|l . The filamentary structures extend out to the 
R — 8 kpc edge of the H I map, well beyond the radius 
where GMCs are observed, with no significant change in 
character at the edge of the molecular disk. It appears 
that some physical parameter, which evidently is a func- 
tion of galactic radius, determines what fraction of the 
atomic gas in a filament is converted into molecules. A 
likely candidate is hydrostatic pressure in the disk, a hy- 
pothesis that will be investigated in a subsequent paper. 

5.5. Are GMCs and HI Holes Physically Linked? 

Expanding shells in the interstellar medium may 
play an important role in the dynamics of the neu- 
tral gas, transmitting both mechanical and radian t 
energy at their boundaries ('M cKee fc Ostrikeil Il977j) . 
IDeul fc den Hartog (1990, DdH) cataloged 148 holes 
(underdensities) in the H I distribution of M33. Ninety- 
three of the holes (Types 2 and 3 in DdH) show the kine- 
matic evidence of expansion in the Westerbork channel 
maps. These holes cover < 20% of the neutral disk area. 
DdH find that many of the compact holes have OB asso- 
ciations inside and conclude that many such holes have 
been excavated by massive star formation activity. While 
DdH also allow for the possibility that holes are formed 
by the local collapse of H I to form GMCs, our CO sur- 
vey shows that this is unlikely, since nearly all GMCs are 
outside the holes. In this section we consider evidence of 
a physical connection between GMCs and the H I shells. 

There is, in fact, a tendency for the catalog GMCs to 
be spatially clustered around the DdH H I holes. We 
quantified this effect by measuring the fraction of GMCs 
located within annuli of width AR bounding the holes, 
and comparing this to the fraction expected if the GMCs 
and holes are uncorrelated. We restricted the analysis to 
holes with diameters less than 400 pc because these are 
more likely to have been created by high mass star for- 
mation. The results are shown in Figure [23 We find a 
significant correlation between GMCs and H I hole edges 
for AR between 60 and 300 pc, most significantly for AR 
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Fig. 23. — Positional correlation of cat alog GMCs with H I holes 
cataloged by IDeul Ik den Hartoel 119901) . Open circle denote the 
fraction of clouds which overlap an annulus of width AR bounding 
some hole with diameter < 400 pc. The thick curve denotes the 
fraction of overlap expected for randomly distributed clouds. The 
thin curve is the binomial probability for the observed deviation, 
which is < 0.05 for annular widths of 60 — 300 pc. The minimum 
probability 0.003, marked by the dashed line, occurs at an annular 
width of 150 pc. 




Fig. 24. — HI holes and associated GMCs. Shown are 2ct 
boundaries (filled) of catalog GMCs and 150 pc annuli (shaded 
areas) around DdH hole edges. From this figure it can be seen that 
no significant CO emission extends into the DdH holes. The large 
separation (150 pc) between hole edges and GMCs suggests that 
expanding shells do not directly trigger formation of the catalog 
GMCs. An extraordinary hole-CMC association (labeled DdH 28) 
may be evidence of a high velocity cloud impacting the H I disk. 



= 150 pc (binomial probability Prand ~ 10~^). The large 
separation (150 pc) between hole edges and GMCs sug- 
gests, however, that expanding shells do not directly trig- 
ger formation of the GMCs. Figure El plots the GMCs 
(boundaries defined by the 2a clipping levels) and the 
150 pc annuli (shaded) around the (compact) hole edges. 
This figure shows that there is no significant CO emission 
inside the DdH holes. 

Since essentially all GMCs are on filaments but not 
all GMCs are at the edges of holes, we conclude 



that the filamentary structure is more fundamental to 
CMC formation than the action of expanding shells. 
These shells may create filamentary structure on small 
scales, especially in the presence of turbule nce (e.g. 
iHartmann. Balleste ros-Paredes. fc Bcrgin l200lD . but are 
unlikely to produce the largest filaments. The energies 
required to evacuate the large holes defining these fil- 
aments are > 10^'^ ergs (see e.g. DdH). Such energies 
would leave bright stellar remnants at the centers of the 
large holes, which are not generally observed. In addi- 
tion, these holes would be sheared apart on time scales 
of 2 X 10^ years by differential galactic rotation unless 
they were stabilized by some process. Thus some orga- 
nizing principle other than stellar feedback seems to be 
responsible for much of the filamentary H I structure, 
and ultimately, for CMC formation in M33. One possi- 
bility is MHD instabil ities in a self-gravitating, shea ring 
disk l|Elmegreenlll987HKim. Ostriker. fc Stoneil2002() . 

Although nearly all GMCs are outside holes on fila- 
ments of H I, an exception is DdH 28 (Figure Ej). The 
catalog GMCs in this hole may result from a high velocity 
cloud (HVC) colliding with the H I disk. This scenario 
for hole creation was first elaborated by iTenorio-Tagld 
(1981). GMCs 66 and 88 have velocities blue-shifted rel- 
ative to local H I by 72 and 24 km s""'^, respectively. The 
formation of GMCs 40, 51, 54, 123, and 133 at the edge of 
the hole - all blucshiftcd relative to local H I- may have 
been triggered by this collision as well. The combined ki- 
netic energy of these clouds (1.4 x 10^^ ergs) could have 
been supplied by an atomic HVC. Alternatively, GMCs 
in DdH 28 may have formed on the front surface of neu- 
tral shells. That is, they may be similar to the Orion 
A and B molecular clouds, which appear to have formed 
from gas lifted 14 pc out of the plane of the Milky Way 
by stellar winds fDame. Hartmann. fc Thad deuj l2001t 
iHartmann. Ballesteros-Paredes. & Bcrgin 200f^ 

5.6. Spatial Comparison of CMC Distribution with H II 

Regions 

Figure |55l compares the locations of GMCs with Ha 
emission, showing that some GMCs follow the weak 
Ha spiral arms of M33. Molecular clouds in the 
overlay are represented by filled circles with area pro- 
portional to mass; the u n derlyin g halftone is a CCD 
image from 'Chen g et al.l l)1996D . Most of the Ha 
flux c omes from H II regio ns, which are cataloged 
in B oulesteix et alJ llT9 74'). W vder. Hodge, fc Skeltoi] 
(119971) ■ and lHodge. et al.. (J^999). As shown in gEIl the 
radial profiles of CO flux and H II region flux have com- 
parable scale lengths; however, not all H II regions are 
associated with GMCs. In fact, there are over 20 times 
as many H II regions as catalog GMCs, and many of 
the H II regions are far from detected clouds. Since our 
GMC catalog has a lower dynamic range than the H II 
region catalog, it's possible that low mass GMCs are as- 
sociated with most of these "orphan" H II regions; ti4.1l 
argues that there could be ~ 2000 molecular clouds in 
M33, comparable to the number of H II regions. Al- 
ternatively, H II regions may rapidly dissipate molecu- 
lar clouds, accounting for the discrepancy in numbers 
(Hartmann. Ballesteros-Paredes. & Bcrgin 2001). 

We calculate the fraction of the Ha flux that orig- 
inates from H II regions that spatially overlap cata- 
log GMCs. Using the positions and radii of H II re- 
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Fig. 25. — Location of GMCs plotted with blue circles on the Ha image of ICheng et'all {199^ . The area of circles is proportional to 
the molecular mass. 



gions from the combined catalogs ('Boulesteb^^a ^l974l: 
IWvder, Hodge. & Skelton 1997: Hod ge, at al.lll999D . we 

find that 40% of the Ha flux originates from H II re- 
gions which are tangent to, or contain, catalog GMCs. If 
the remaining 2.2 x 10^ Mq of molecular gas f ij3.4|l pro- 
duced a proportional amount of Ha flux, at least 80% 
of the emission would arise from H II regions associated 
with GMCs. 

One may also ask what fraction of the GMCs are ac- 
tively forming stars. Figure ESI shows the positional cor- 
relation of catalog GMCs with H II regions (circles) . We 
counted the fraction of GMCs that have at least one H II 
region within distance Ar. The thin curve is the frac- 
tion expected from random association (see Appendix 
B). There is a statistically significant clustering of GMCs 
and H II regions out to a separation of 150 pc. As many 
as 100 GMCs (67%) are apparently forming massive stars 
- that is, their centroid position is within 50 pc of some 
H II region. This is a similar fract ion of association 
as found in a more limited study bv iWilson fc Scovilld 



1IT99I . Of the remaining GMCs, we estimate that only 
a few contain obscured Ha sources. From our correla- 
tion curve, there are only 7 H II regions coincident with 
a GMC position (Ar < 10 pc). By symmetry, we ex- 
pect that only ~ 7 catalog GMCs are obscuring signs of 
massive star formation. 

Winds and ionizing radiation from massive stars de- 
stroy molecular clouds. Since at least 2/3 of our sources 
are associated with H II regions, we infer that GMCs 
spend less than 1/3 of their life in a quiescent phase prior 
to the onset of massive star formation. The fraction of 
time spent in the quiescent phase appears to be even 
smaller for the more massive clouds. Figure 1261 shows 
that 85% of the higher mass GMCs are within 50 pc of 
H II regions versus 55% of the lower mass GMCs. A re- 
ciprocal correlation exists between luminous H II regions 
and GMCs: luminous H II regions (top 10%) are nearly 
twice as likely to have an associated catalog GMC than 
the population as a whole. 

Between the cutoff of the molecular disk {R — 4.5 kpc, 
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Fig. 26. — Figure 1261 shows the positional correlation of 
catalog GMCs with H II regions. The H II region posi- 
tions are from the combin ed catalogs of Boulcstcix ct al. (1974), 
IWvder. Hodge. &: Skeltoiil iT99711 . and .Hodge, et al... (1999). Wc 
measured the fraction of GMCs with at least one H II region within 
a separation Ar. The thin curve is the fraction expected from 
random association. There is statistically significant clustering of 
GMCs and H II regions out to a separation of 150 pc. As many 
as 100 GMCs (67 %) are apparently forming massive stars — that 
is, their centroid position is within 50 pc of some H II region. 85% 
of the higher mass GMCs (triangles) are within 50 pc of H II re- 
gions versus 55% of the lower mass GMCs (circles). This result is 
consistent with a shorter quiescent time for higher mass clouds. 

H5.1|l and the edge of the star forming disk (i? = 6.7 kpc), 
there are 7 cataloged GMCs and 600 H II regions. Com- 
paring the total Ha luminosity in this region to the total 
cataloged molecular mass, we find the ratio Lua/ AfH2 
for this region is 8 times larger than in the inner galaxy 
{R < 4.5 kpc). Correcting for different sensitivities and 
survey coverage in the two regions can only account for 
a factor of two discrepancy. Based on the Ha flux, we 
would expect to have cataloged at least 4 times as much 
molecular mass in the outer galaxy as we actually do. It 
is unlikely that a burst of star formation in this region has 
depleted the supply of GMCs. We conclude either that 
dispersal times for GMCs are significantly shorter in the 
outer galaxy, that the mass spectrum of the GMCs be- 
comes even steeper there, or the metallicity significantly 
affects the CO-to-H2 conversion factor in this region. 

6. SUMMARY 

In this paper we present the results of an unbiased in- 
terfcrometric survey of the star forming disk of M33 in 
C0(1— >0); the observations were done using the BIMA 
array. The 50 pc linear resolution of our survey map is 
comparable to the size of most GMCs. We derive a cat- 
alog of GMCs for M33 complete down to 1.5 xlO^ Mq. 
From simple statistics, we expect that no more than 15 
of the 148 sources listed in our catalog are spurious. We 
estimate that approximately 60 clouds with a total mass 
of 4.3 X 10^ Mq have been falsely rejected from the cat- 
alog. The interferometer data were compared with sin- 
gle dish fields observed at the UASO 12 m telescope to 
estimate the effects of spatial filtering and masking on 
the catalog. Our results generally agree with those of 



iWilson fc Scovillj l)199Qjl . but the greater spatial cover- 
age and flux completeness of the BIMA survey leads, in 
some cases, to different physical inferences. 
From the survey data, we conclude the following: 

1. The total mass of GMCs in our catalog is 2.3 x 
lO'^ Mq. From a comparison of the interferometer and 
single dish data in selected fields, we estimate that the 
total molecular mass of the galaxy is 4.5 x 10'' Mq, 2% 
of the atomic mass. 

2. GMCs with masses above our completeness limit are 
described by a power-law distribution with dN/dM oc 
M-2-6±0-3. The power-law index of GMCs in M33 is 
steeper than that in the Milky Way {dN/dM cx M'^-^). 
We infer that there exists a low mass cutoff or a change 
in the index to < 2 between 3 x lO'* Mq and 10^ Mq. 
The cutoff or change in slope implies that GMCs in M33 
form with a characteristic mass of ~ 7 x 10"* Mq. 

3. The surface density of molecular gas decreases ex- 
ponentially in radius, with a scale length of 1.4 ±0.1 kpc. 
The scale length agrees well with the surface brightness 
of Ha emission (1.7±0.2 kpc) implying a scaling between 
star formation and molecular gas surface density of SFR 
oc E^Q^"'^. Based on the Ha luminosity of M33, we es- 
timate a star formation rate of 0.24 Mq yr~^, implying 
a molecular gas depletion time of 1.9 x 10* yr. This very 
short time is not unreasonable given the large reservoir 
of atomic gas available at all radii. 

4. The rotation curve of the galaxy derived from CO 
emission is in excellent agreement with that of the H I. 
We find no evidence for large scale radial motions of 
molecular gas in the galaxy. 

5. Giant molecular clouds are preferentially found on 
bright H I filaments where the H I column is approxi- 
mately 10^^ cm~^. Regions with H I column densities 
< 5 X 10^° cm~^ are devoid of catalog GMCs. 

6. We estimate a CMC lifetime of 10-20 Myr based on 
the close association between GMCs and H I filaments. 
Given the typical velocity difference of 8 km s~^, the 
clouds would drift away from the filaments on longer time 
scales. 

7. Since the filamentary structure of the H I gas 
appears to be independent of the existence of nearby 
GMCs, we conclude that it is the H I filamentary struc- 
ture that forms first. The fraction of gas that becomes 
molecular is then determined by some other parameter, 
such as hydrostatic pressure, that is inversely correlated 
with galactic radius. 

8. At least 40% of the Ha fiux is associated with cata- 
log GMCs. If the undetected molecular mass produces a 
proportional amount of Ha flux, at least 80% of the emis- 
sion would arise from H II regions associated with GMCs. 
At least 2/3 of catalog GMCs are within 50 pc of an H II 
region, so it appears that GMCs spend less than 1/3 of 
their lifetime in a quiescent phase prior to the onset of 
star formation. Clouds above the median mass spend 
less than 15% of their lifetime in the quiescent phase. 
H II regions above the 90th percentile in luminosity are 
nearly twice as likely to have a CMC within 50 pc as 
compared to the entire population. 

9. Nearly all catalog GMCs are exterior to H I holes. 
Clearly, H I holes do not result from conversion of atomic 
to molecular gas. GMCs and compact H I holes ( d < 
400 pc) are clustered over separation scales of 60 — 300 
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pc. However, since many GMCs are not associated with 
holes, the holes probably play only a secondary role in 
GMC formation. 

10. GMCs 66 & 88 are the only GMCs interior to 
an H I hole (DdH 28); they are highly blueshifted with 
respect to H I. GMCs 40, 51, 54, 123 & 133, located along 
the edge of DdH 28, are also blueshifted with respect to 
H I. Conceivably these clouds were formed by the impact 
of a high velocity cloud on the atomic disk. 

11. Although the molecular disk shows a sharp decline 
in the number of GMCs beyond R ~ 4.5 kpc, the Ha 
disk extends to R 6.7 kpc. This suggests that, for 



the outer disk, either dispersal times of GMCs are sig- 
nificantly shorter than for the inner disk or that there 
exists a large population of GMCs below our detection 
threshold. 



This work was partially supported by NSF grant AST- 
9981308 to the University of California. This research 
made extensive use of NASA's Astrophysics Data Sys- 
tem (ADS) and the NASA/IPAC Extragalactic Database 
(NED). ER's work is supported in part by a NSF Grad- 
uate Fellowship. 



REFERENCES 



Blitz, L. & Shu, F. H. 1980, ApJ, 238, 148 

Blitz, L. 1993, in Protostaxs and Planets III, 125-161 

Blitz, L., Bazell, D., & Desert, F.X. 1990 ApJ, 352, L13 

Boulesteix, J., Courtes, G., Laval, A., Monnet, G., & Petit, H. 

1974, A&A, 37, 33 
Cheng, K.P., Hintzen, P., Smith, E.P., Angione, R., Talbert, F., 

Collins, N., Stecher, T. 1996, BAAS, 28, 904 
CorbcUi, E. 2003, MNRAS, 342, 199 
CorbcUi, E. & Salucci, P. 2000, MNRAS, 311, 441 
Dame, T. M., Elmegreen, B.C., Cohen, R.S., & Thaddeus, P. 2001, 

ApJ, 305, 892 

Dame, T. M., Hartmann, D., & Thaddeus, P. 2001, ApJ, 547, 792 
Dcul, E. R. & den Hartog, R. H. 1990, A&A, 229, 362 (DdH) 
Dcul, E. R., & van der Hulst, J. M. 1987, A&AS, 67, 509 
Digel, S. W., Aprile, E., Hunter, S. D., Mukherjee, R., & Xu, F. 

1999, ApJ, 520, 196 
Elmegreen, B. G. 1987, ApJ, 312, 626 

Elmegreen, B. G. 1990, ASP Conf. Ser. 12: The Evolution of the 

Interstellar Medium, 247 
Engargiola, G. & Plambeck, R. L. 1998, Proc. SPIE, 3357, 508 
Fukui, Y., Mizuno, N., Yamaguclii, R., Mizuno, A., & Onishi, T. 

2001, PASJ, 53, L41 
Hartmann, L., Ballestcros-Paredes, J., & Bergin, E. A. 2001, ApJ, 

562, 852 

Heyer, M. H., Carpenter, J. M., & Snell, R. L. 2001, ApJ, 551, 852 
Hodge, P. W., Balslcy, J., Wyder, T. K., & Skelton, B. P. 1999, 

PASP, 111, 685 
Israel, F. P. & van der Kruit, P. C. 1974, A&A, 32, 363 
Issa, M., MacLaren, I., & Wolfendale, A. W. 1990, ApJ, 352, 132 
Kennicutt, R. C. 1998, ApJ, 498, 541 
Kennicutt, R. C. 1989, ApJ, 344, 685 

Kim, W., Ostriker, E. C, & Stone, J. M. 2002, ApJ, 581, 1080 
Klessen, R. S., Burkert, A., & Bate, M. R. 1998, ApJ, 501, L205 



Lee, M. G., Preedman, W. L., & Madore, B. F. 1993, ApJ, 417, 
553 

McKee, C. F. & Ostriker, J. P. 1977, ApJ, 218, 148 

Mizuno, N. et al. 2001, PASJ, 53, 971 

Gort, J. H. 1954, Bull. Astron. Inst. Netherlands, 12, 177 

Rosolowsky, E., Plambeck, R., Engargiola, G., & BUtz, L. 2003, 

ApJ, Accepted. (Paper II) 
Sanders, D. B., Scoville, N. Z., & Solomon, P. M. 1985, ApJ, 289, 

373 

Savage, B. D., Drake, J. F., Budich, W., & Bohlin, R. C. 1977, 

ApJ, 216, 291 
Scoville, N. Z. & Hcrsh, K. 1979, ApJ, 229, 578 
Solomon, P. M., Rivolo, A. R., Barrett, J., & Yahil, A. 1987, ApJ, 

319, 730 

Strong, A. W. & Mattox, J. R. 1996, A&A, 308, L21 
Strong, A. W. ct al. 1988, A&A, 207, 1 

Stutzki, J. 1999, Plasma Turbulence and Energetic Particles in 
Astrophysics, Proceedings of the International Conference, 
Cracow(Poland), 5-10 September 1999, Eds.: Michal Ostrowski, 
Reinliard Schlickciser, Obscrwatorium Astronomiczne, 
Uniwersytct Jagiellonski, Krakow 1999, p. 48-60., 48 

Tenorio-Taglc, G. 1981, A&A, 94, 338 

Thilkcr, D. 2003, private communication. 

Wilson, C. D. & Scoville, N. 1989, ApJ, 347, 743 

Wilson, C. D. & Scoville, N. 1990, ApJ, 363, 435 

Wilson, C. D. & Scoville, N. 1991, ApJ, 370, 184 

Wilson, C. D., Scoville, N., & Rice, W. 1991, AJ, 101, 1293 

Wong, T. & Blitz, L. 2002, ApJ, 569, 157 

Wyder, T. K., Hodge, P. W., & Skelton, B. P. 1997, PASP, 109, 
927 



18 



APPENDIX 

PROBABILITY CALCULATION 

This appendix provides a brief description of the calculation used to rank the likelihood of candidate molecular 
clouds in the data cube. 

As discussed in the text, the flux density S at each pixel in the data cube is first normalized by the RMS noise a, 
as a function of position and velocity, to produce a signal-to-noise, or significance, image y: 

si.,y,v) = ^^^^. (Al) 
a{x,y)a(v) 

Suppose that at some location in the data cube we have identified a spectral feature, occupying n adjacent velocity 
channels, which we suspect may be CO emission from a molecular cloud. What is the likelihood that this is a false 
detection? 

Assume that the signal-to- noise ratios of the n channels have been sorted in ascending order such that si < S2 < 
. . . < s„ . We wish to calculate the probability of randomly drawing n values from a normalized Gaussian distribution 
(with zero mean and standard deviation 1), such that all n values are > si, at least (n — 1) values are > S2, ■ ■ ■ , and 
at least 1 value is > s„ - that is, the probability of selecting n values which are at least as unlikely as the actual 
spectrum. 

Suppose that one draws a single value s from the Gaussian distribution. There are (n -I- 1) mutually exclusive 
outcomes: 

s < si, with probability po — 0.5{1 — erfc (si/\/2)} 

s > Sn, with probability Pn = 0.5{erfc (s„/ \/2)} 
where 

erfc(x) = ^ / e~*^dt. (A2) 

Jx 

Then, if one draws a total of n values from the distribution, the probability that kg have values s < Si, fci have 
values in the interval si < s < S2, ■ ■ ■ , and that s„ have values s > yn, is given by the multinomial distribution: 

p{ko, fci, . . . , /c„) = ^^i"" _ ^ , Po>i' ■■■Pn"- (A3) 

The probability of a false detection is the sum of all the multinomial terms which fulfill the original requirement, 
namely that fco = 0, fc„ > 1, (fc„_i -I- > 2, and so on: 

Pfaise = p{0, 0, . . . , n) + p(0, 0, . . . , 1, n - 1) + . . . + p(0, 1, . . . , 1). 

The M33 data cube contains approximately 20500 independent positions, each of which corresponds to a 220-channel 
spectrum. For an A'^c/i-channel spectrum, there are (Nch ~n+l) ways of selecting n adjacent channels. Thus, over the 
94 channel range covering the velocity interval —290 < Vlsr< —80 there are of order Ntriais = 20500 (94— n) ~ 2 x 10^ 
ways of selecting n adjacent channels from the data cube. If Pfaise <C 1 and Ntriais ^ 1, the probability of r false 
detections is given by Poisson statistics: 

P(^) = (^[fLe-^P. (A4) 
r! 

The probability of any given detection being real is Preai = P{r = 0) = e~^^ ~ 1 — NP for NP ^ 1. If there are a 
given number of detections at this significance level, the probability of q of them being false is simply P{q). 

As an example, suppose we detect 3 adjacent channels with si = S2 — S3 ~ 2.5 a. In this case the probability of 
a false detection is simply the product of the probabilities of a false detection in each of the 3 channels: Pfaise = 
{G.5erfc(2.5/V2)}3 = 2.39 x 10"'^. Using the Poisson formula, one finds a 60% likelihood of all sources at this 
significance being real. 

If the signal-to-noise ratios of the 3 channels are unequal, then the probability of a false detection is greater than the 
simple product of the 3 probabilities, essentially because one has more than one chance to draw the less likely values 
from the population of Gaussian deviates. For example, if si = 2.0 u, S2 = 2.5 cr, and S3 = 3.4 cr, 

Pfaise = p{0, 0, 0, 3) + p{0, 0, 1, 2) + p{0, 0, 2, 1) + p{0, 1, 0, 2) + p{0, 1, 1, 1) = 2.39 x 10"^ (A5) 

whereas the simple product of the probabilities is 4.76 x 10~*. Again, Poisson statistics calculates about a 60% 
likelihood that all sources at this level are real. Because of the non-Gaussian nature of the noise, we ultimately use 
this probability as a relative measure for ranking the significance of cloud candidates fi j3.1|l . 

GMC-H II REGION SPATIAL CORRELATION FUNCTION 

To understand in a statistical sense how massive star formation affects GMCs, it is a matter of interest to determine 
how many GMCs in our catalog are physically linked with Ha emitting H II regions. The filling factors / for Ha and 
CO in the survey region are 2.4% and 0.41%, respectively, where both types of emission show strong positional 
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correspondence with H I filaments. H II regions are so widespread that, to some degree, observed proximity to GMCs 
may be random. If H II regions and GMCs are all randomly distributed in a survey area Agurvey, the average separation 
(r) between a GMC and the nearest H II region obeys the relation 



(Ami) 



(Bl) 



where {Ami), the average area of an optical H II region, is 700 pc^. If we assume an H II region and GMC are 
closely associated if they are separated by no more than ro , say, then the expectation value for the number of close 
associations is 



'■^,A,,„rr,!j.f-Hafco 



Assuming that (Aco) has approximately the same area as a resolution element of our survey, (^hii)/(^co) ^ 0.4, 
and (ro) ~ 40 pc - the average projected separation of an H II region lying at the edge of a GMC - then N^xp ~ 34. 
A comparison of our GMC catalog to the H II region catalogs previously referenced yields 100 H II regions in close 
association with GMCs, about a factor of three in excess of what might be expected if GMC and H II positions were 
random. 



